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METHOD FOR EXPRESSION OF SMALL RNA MOLECULES WITHIN A CELL 

Government Support 

[0001] This invention was made with government support under Grant Number 
GM39458 awarded by the National Institutes of Health. The United States Government has 
certain rights in the invention. 

Background of the Invention 

Field of the Invention 

[0002] The present invention relates generally to methods for altering gene 
expression in a cell or an animal using viral constructs engineered to deliver an RNA 
molecule. In a more specific aspect, a viral construct is used to deliver double-stranded 
RNA molecules that can be used to down-regulate or modulate gene expression. 

Description of the Related Art 

[0003] RNA interference (RNAi) or silencing is a recently discovered 
phenomenon (A. Fire et al y Nature 391, 806 (1998); C.E. Rocheleau et al Cell 90, 707 
(1997)). Small interfering RNAs ("siRNAs") are double-stranded RNA molecules that 
inhibit the expression of a gene with which they share homology. siRNAs have been used 
as a tool to down regulate the expression of specific genes in a variety of cultured cells as 
well as in invertebrate animals. A number of such approaches have been reviewed recently 
(P.D. Zamore Science 296, 1265 (2002)); however, such approaches have limitations. For 
example, no technique prior to the invention described herein allows for the generation of 
transgenic mammal having a specific gene down regulated through RNA interference. 
Similarly, there is a need for more robust methods for the introduction of small RNA 
molecules with regulatory function. The invention provided herein addresses these and 
other limitations in the field of RNA mediated gene regulation. 

Summary of the Invention 
[0004] The invention relates generally to methods to express within a cell an 
RNA molecule or molecules. These methods can be used with a wide variety of cell types. 
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RNA molecules can be expressed within a cell for a variety of purposes. For example, and 
without limitation, RNA molecules can serve as markers within a cell, can act as antisense 
oligonucleotides or ribozymes for regulating gene expression, and can serve to down 
regulate genes through RNA interference. 

[0005] In one aspect, the invention provides retroviral constructs for the 
expression of an RNA molecule or molecules within a cell. The constructs preferably 
comprise a nucleic acid having the R and U5 sequences from a 5' lentiviral long terminal 
repeat (LTR), a self-inactivating lentiviral 3' LTR, and a RNA Polymerase III (pol HI) 
promoter. The retroviral constructs preferably comprise an RNA coding region operably 
linked to the RNA Polymerase m promoter. The RNA coding region preferably comprises 
a DNA sequence that can serve as a template for the expression of a desired RNA molecule. 

[0006] The RNA coding region can be immediately followed by a pol III 
terminator sequence which directs the accurate and efficient termination of RNA synthesis 
by pol III. The pol in terminator sequences generally comprise 4 or more consecutive T 
residues. In a preferred embodiment, a cluster of 5 consecutive Ts is used as the terminator 
by which pol HI transcription is stopped at second or third T of the DNA template. As a 
result, only 2 to 3 U residues are added to the 3 1 end of the RNA that is synthesized from 
the RNA coding region. 

[0007] A variety of pol III promoters can be used with the invention, including 
for example, the promoter fragments derived from HI RNA genes or U6 sn RNA genes of 
human or mouse origin or from any other species. In addition, pol HI promoters can be 
modified/engineered to incorporate other desirable properties such as to be inducible by 
small chemical molecules either ubiquitously or in a tissue-specific manner, for example, 
one activated with tetracycline or IPTG (lad system). 

[0008] The pol m promoter, RNA template region and pol HI terminator 
together may comprise an "RNA cassette" or "RNA expression cassette." If the RNA is a 
small inhibitory RNA (siRNA), the expression cassette may be termed an "siRNA 
expression cassette." 

[0009] In one embodiment, the RNA coding region encodes a self- 
complementary RNA molecule having a sense region, an antisense region and a loop 
region. Such an RNA molecule when expressed desirably forms a "hairpin" structure. The 
loop region is generally between about 2 and about 10 nucleotides in length. In a preferred 

2 



WO 03/022052 



PCT7US02/29215 



embodiment, the loop region is from about 6 and about 9 nucleotides in length. In one such 
embodiment of the invention, the sense region and the antisense region are between about 
15 and about 30 nucleotides in length. 

[0010] In one embodiment, the RNA coding region is operably linked 
downstream to an RNA Polymerase EI promoter such that the RNA coding sequence can be 
precisely expressed without any extra non-coding nucleotides present at 5' end. In this way 
an RNA sequence can be expressed that is identical to a target sequence at the 5' end. The 
synthesis of the RNA coding region is ended at the terminator site. In one preferred 
embodiment the terminator consists of five consecutive T residues. 

[0011] In another aspect of the invention, the retroviral vector can comprise 
multiple RNA coding regions. In one embodiment, the retroviral construct comprises a first 
RNA pol IE promoter, a first coding region encoding a first RNA molecule operably linked 
to the first RNA pol m promoter, a second RNA pol IH promoter and a second RNA coding 
region operably linked to the second RNA pol m promoter. Preferably, the second RNA 
coding region encodes an RNA molecule that is substantially complementary to the RNA 
molecule encoded by the first RNA coding region, such that the two RNA molecules can 
form a double-stranded structure when expressed. The methods of invention also include 
multiple RNA coding regions that encode hairpin-like self-complementary RNA molecules 
or other non-hairpin molecules. 

[0012] In yet another embodiment of the invention, the retroviral construct 
comprises a first RNA pol HI promoter operably linked to a first RNA coding region, and a 
second RNA pol IH promoter operably linked to the same first RNA coding region in the 
opposite direction, such that expression of the RNA coding region from the first RNA pol 
HI promoter results in a synthesis of a first RNA molecule as the sense strand and 
expression of the RNA coding region from the second RNA pol III promoter results in 
synthesis of a second RNA molecule as an antisense strand that is substantially 
complementary to the first RNA molecule. In one such embodiment, both RNA 
Polymerase m promoters are separated from the RNA coding region by termination 
sequences, preferably termination sequences having five consecutive T residues. 

[0013] According to one embodiment of the invention, the 5' LTR sequences in 
the retroviral construct are derived from HIV. The retroviral construct may also comprise a 
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woodchuck hepatitis virus enhancer element sequence and/or a tRNA amber suppressor 
sequence. 

[0014] In another embodiment of the invention, the self-inactivating 3' LTR is a 
U3 element with a deletion of its enhancer sequence. In yet another embodiment, the self- 
inactivating 3' LTR is a modified HIV 3' LTR. 

[0015] The recombinant retroviral construct can be pseudotyped, for example 
with the vesicular stomatitits virus envelope glycoprotein. 

[0016] In another aspect of the invention, expression of the RNA coding region 
results in the down regulation of a target gene. Preferably the target gene comprises a 
sequence that is at least about 90% identical with the RNA coding region, more preferably 
at least about 95% identical, and even more preferably at least about 99% identical. 

[0017] According to a further aspect of the invention, the viral construct also 
comprises a nucleotide sequence encoding a gene of interest. The gene of interest is 
preferably operably linked to a Polymerase II promoter. Such a construct also can contain, 
for example, an enhancer sequence operably linked with the Polymerase II promoter. 

[0018] A variety of Polymerase II promoters can be used with the invention, 
including for example, the CMV promoter. The RNA Polymerase II promoter that is 
chosen can be a ubiquitous promoter, capable of driving expression in most tissues, for 
example, the human Ubiquitin-C promoter, CMV p-actin promoter or PGK promoter. In 
other embodiments the RNA Polymerase II promoter is a tissue-specific promoter. 

[0019] In one embodiment, the gene of interest is a marker or reporter gene, that 
can be used to verify that the vector was successfully transfected or transduced and its 
sequences expressed. In one such embodiment, the gene of interest is a fluorescent reporter 
gene, for example, the Green Fluorescent Protein. In yet another embodiment, the gene of 
interest is a drug resistant gene which can be used to select the cells that are successfully 
transduced. For example, the drug resistant gene can be the zeocin resistant gene (zeo). 
The gene of interest also can be a hybrid of a drug resistant gene and a fluorescent reporter 
gene, such as a zeo/gfp fusion. In another embodiment, the gene of interest encodes a 
protein factor that can regulate the transcription activity of inducible pol HI promoters. In 
one of such embodiment, the gene of interest is tetR (repressor for tet operon) which 
regulates tetracycline responsive pol III promoters. 
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[0020] It is another aspect of the invention to provide methods for expressing an 
RNA molecule or molecules within a ceil According to the invention, a packaging ceii line 
is transfected with a retroviral construct of the invention, recombinant retroviral particles 
are recovered from the packaging cell line; and a target cell is infected with the 
recombinant retrovirus particles. 

[0021] In one embodiment of the invention, the target cell is an embryonic cell. 
An embryonic cell may be, for example, a single cell embryo or embryonic cells from 
within an early-stage embryo, hi another embodiment of the invention, the target cell is an 
embryogenic stem cell. When the target cell is an embryonic cell, in one embodiment the 
embryonic cell is infected by injecting the recombinant retrovirus between the zona 
pellucida and the cell membrane of the embryonic cell. In another embodiment, the 
embryonic cell is infected by removing the zona pellucida and incubating the cell in 
solution containing the recombinant retrovirus. In such an embodiment, the zona pellucida 
can be removed, for example, by enzymatic digestion. 

[0022] When the target cell is an embryonic cell or an embryogenic stem cell, 
the cell may be transplanted in a pseudopregnant female to generate a transgenic animal 

[0023] The methods of the invention also can be used with a variety of primary 
ex vivo normal or diseased cells or cells adapted in various tissue culture conditions from 
human, mouse and other vertebrates, including, without limitation, stem or precursor cells 
for the hematopoietic system, central nerve system cells, cells with regenerative capacities 
from a variety of other tissues and organs, dendritic cells and other developing and mature 
myeloid and lymphoid cells, and cancer cells derived from different cell lineages. 

[0024] In a particular embodiment, the target cell is an embryonic cell of a bird 
within an egg. The embryonic cell of a bird is preferably infected by contacting the 
embryonic blastodisc of the bird egg with retroviral particles. 

[0025] In yet another embodiment, the target cell is a fish egg. The fish egg is 
preferably infected by delivering the retroviral particles to the space between the chorion 
and the cell membrane of the fish egg. 

Brief Description of the Drawings 

[0026] Figure 1 A shows a schematic diagram of a retroviral vector carrying an 
expression cassette for RNA expression, termed "RNA cassette" and a "Marker Gene" or 
gene of interest. The RNA expression cassette can be embedded at any permissible sites of 
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the retroviral construct either as single copy or multiple tandem copies. In addition, 
although not indicated in the figure, more than one RNA expression cassette may be present 
in the retroviral construct. Figure IB shows a similar construct in which the RNA 
expression cassettes flank a marker gene. 

[0027] Figure 2 shows a schematic view of an RNA expression cassette 
comprising an RNA Polymerase in promoter 100 linked to an RNA coding region 110-130 
and a terminator sequence 140. The RNA coding region comprises a sense region 110, a 
loop region 120, and an antisense region 130, 

[0028] Figure 3 shows a schematic view of an RNA expression cassette having 
an RNA Polymerase IE promoter 100 linked to a first RNA coding region 110 and a first 
terminator sequence 140 and a second RNA polymerase HI promoter 105 linked to a second 
RNA coding region 115 and a second terminator 145, 

[0029] Figure 4 shows a schematic view of an RNA expression cassette having 
a first RNA Polymerase m promoter 100 linked to an RNA coding region 110 and a first 
terminator sequence 145. The expression cassette has a second RNA polymerase HI 
promoter 105 linked to the RNA coding region 115, the same sequence as 110 in reverse, 
and a second terminator 140. 

[0030] Figure 5. Schematic illustration of a lacZ siRNA encoding lentiviral 
vector. 5 'LTR: an HIV based lentiviral vector 5' LTR; F: an HIV Flap element; pol EQ: a 
human HI -RNA pol HI promoter (-240 to -8); siRNA: a lacZ specific small hairpin RNA 
coding region and its structure and detailed sequence are illustrated below. UbiC: an 
internal human ubiquitinC promoter; GFP: a GFP marker gene driven by UbiC promoter. 
W: a woodchuck RNA regulatory element. 3'LTR: an HIV based self inactivating 
lentiviral 3* LTR. 

[0031] Figure 6. A lacZ specific siRNA encoded by a lentiviral vector can 
efficiently inhibit the expression of lacZ reporter gene in virus transduced mammalian cells. 
MEF: mouse embryonic fibroblasts; HEK293: human embryonic kidney cells. Both of the 
test cell lines harbor lacZ and firefly luciferase reporter genes, and the expression levels of 
the reporter genes can be measured by chemiluminescent assays. Ctrl: the ratio of lacZ 
activity versus Luc activity of the uninfected parental cells, which was arbitrarily set to 1. 
Transduced: the specific inhibition of lacZ expression calculated as the reduction of lacZ to 
Luc ratio. 
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[0032] Figure 7. Transgenic animals that express a lacZ specific siRNA 
molecule encoded by a lentiviral vector can successfully suppress the expression of the 
ubiquitous lacZ reporter gene in a ROSA26+/- background. ROSA1-6: the lacZ activities 
in the limb tissues of six El 7.5 ROSA26+/- embryos which served as positive controls. 
The difference in lacZ activity between individual ROSA26+/- embryos may result from 
variable protein extraction efficiency. TG1-4: the lacZ activities in the limb tissues of four 
El 7.5 transgenic embryos expressing a lentiviral vector-encoded lacZ siRNA molecule in 
ROSA+A background. WT1-6: lacZ activity in the limb tissues of six E17.5 C57B1/6 
wildtype embryos, included as the negative control. The background levels of endogenous 
beta-galactosidase activity are general below 1,000 LU/ug, thus the columns are not visible. 

[0033] Figure 8 shows a schematic illustration of a Tet-inducible lacZ siRNA 
lentiviral vector. A Tet repressor gene (TetR; SEQ ID NO: 5) is the under the control 
human UbiquitinC promoter and its expression can be monitored by the downstream GFP 
marker coupled by IRES element (internal ribosomal entry site). The anti-lacZ siRNA 
cassette is driven by a Tet-inducible pol III promoter derived from human U6-promoter (- 
328 to +1) containing a single TetR binding site (TetOl) between the PSE and TATA box 
(SEQ ID NO: 4). In the absence of tetracycline, TetR binds to the promoter and its 
expression is repressed. Upon the addition of tetracycline, TetR is moved from the 
promoter and transcription starts. 

[0034] Figure 9 shows the results of an experiment that demonstrated that a Tet- 
inducible siRNA expression cassette can regulate gene expression in response to 
Doxycycline treatment. lacZ and luciferase double expressing HEK293 cells (293Z+Luc) 
were transduced with a lentiviral vector carrying a Tet-inducible lacZ-siRNA cassette and a 
Tet repressor under the control of a UbiquitinC promoter (Figure 8). The transduced cells 
were treated with 10 ug/ml Doxycycline (Plus Dox) for 48hr or without the Doxycycline 
treatment as a control (No Dox). LacZ and luciferase activities were measured as described 
in the previous figures. The relative suppression activity is calculated as the ratio of lacZ 
versus luciferase and No Dox control was arbitrarily set to 1 . 

Detailed Description of the Preferred Embodiment 
[0035] The inventors have previously identified a method for introducing a 
transgene of interest into a cell or animal. This technique is described in co-pending U.S. 
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provisional patent application number 60/322,031 filed on 9/13/2001 and co-pending U.S. 
provisional patent application number 60/347,782 filed on 1/9/2002, the entire contents of 
which are incorporated herein by reference. 

[0036] Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Any methods, devices and materials similar or equivalent to those 
described herein can be used in the practice of this invention. 

[0037] By "transgene" is meant any nucleotide sequence, particularly a DNA 
sequence, that is integrated into one or more chromosomes of a host cell by human 
intervention, such as by the methods of the present invention. In one embodiment, a 
transgene is an "RNA coding region." In another embodiment the transgene comprises a 
"gene of interest." In other embodiments the transgene can be a nucleotide sequence, 
preferably a DNA sequence, that is used to mark the chromosome where it has integrated. 
In this situation, the transgene does not have to comprise a gene that encodes a protein that 
can be expressed. 

[0038] A "gene of interest" is a nucleic acid sequence that encodes a protein or 
other molecule that is desirable for integration in a host cell. In one embodiment, the gene 
of interest encodes a protein or other molecule the expression of which is desired in the host 
cell. In this embodiment, the gene of interest is generally operatively linked to other 
sequences that are useful for obtaining the desired expression of the gene of interest, such 
as transcriptional regulatory sequences. 

[0039] A "functional relationship" and "operably linked" mean, without 
limitation, that the gene is in the correct location and orientation with respect to the 
promoter and/or enhancer that expression of the gene will be affected when the promoter 
and/or enhancer is contacted with the appropriate molecules. 

[0040] An "RNA coding region" is a nucleic acid that can serve as a template 
for the synthesis of an RNA molecule, such as an siRNA. Preferably, the RNA coding 
region is a DNA sequence. 

[0041] A "small interfering RNA" or "siRNA" is a double-stranded RNA 
molecule that is capable of inhibiting the expression of a gene with which it shares 
homology. In one embodiment the siRNA may be a "hairpin" or stem-loop RNA molecule, 
comprising a sense region, a loop region and an antisense region complementary to the 
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sense region. In other embodiments the siRNA comprises two distinct RNA molecules that 
are non-covalently associated to form a duplex. 

[0042] The term /'transgenic" is used herein to describe the property of 
harboring a transgene. For instance, a "transgenic organism" is any animal, including 
mammals, fish, birds and amphibians, in which one or more of the cells of the animal 
contain nucleic acid introduced by way of human intervention, such as by the methods 
described herein. In a transgenic animal that comprises a transgene that encodes a gene of 
interest, the transgene typically causes the cell to express or overexpress a recombinant 
protein. However, according to the methods of the invention, expression of an RNA coding 
region can be used to down regulate the expression of a particular gene through antisense or 
RNA interference mechanisms. 

[0043] The terms "founder," "founder animal" and "founder line" refer to those 
animals that are mature products of the embryos or oocytes to which the transgene was 
added, i.e. those animals that grew from the embryos or oocytes into which DNA was 
inserted. 

[0044] The terms "progeny" and "progeny of the transgenic animal" refer to any 
and all offspring of every generation subsequent to the originally transformed animal. 

[0045] The term "animal" is used in its broadest sense and refers to all animals 
including mammals, birds, fish, reptiles and amphibians. 

[0046] The term "mammal" refers to all members of the class Mammalia and 
includes any animal classified as a mammal, including humans, domestic and farm animals, 
and zoo, sports or pet animals, such as mouse, rabbit, pig, sheep, goat, cattle and higher 
primates. 

[0047] The term "oocyte" refers to a female gamete cell and includes primary 
oocytes, secondary oocytes and mature, unfertilized ovum. As used herein, the term "egg" 
when used in reference to a mammalian egg, means an oocyte surrounded by a zona 
pellucida. The term "zygote" refers to a fertilized ovum. The term "embryo" broadly refers 
to an animal in the early stages of development. 

[0048] 'Terivitelline space" refers to the space located between the zona 
pellucida and the cell membrane of a mammalian egg or embryonic cell. 

[0049] 'Target cell" or "host cell" means a cell that is to be transformed using 
the methods and compositions of the invention. 
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[0050] "Lentivirus" refers to a genus of retroviruses that are capable of infecting 
dividing and non-dividing cells. Several examples of lentiviruses include HIV (human 
immunodeficiency virus; including HIV type 1, and HIV type 2), the etiologic agent of the 
human acquired immunodeficiency syndrome (AIDS); visna-maedi, which causes 
encephalitis (visna) or pneumonia (maedi) in sheep, the caprine arthritis-encephalitis virus, 
which causes immune deficiency, arthritis, and encephalopathy in goats; equine infectious 
anemia virus, which causes autoimmune hemolytic anemia, and encephalopathy in horses; 
feline immunodeficiency virus (FIV), which causes immune deficiency in cats; bovine 
immune deficiency virus (BIV), which causes lymphadenopathy, lymphocytosis, and 
possibly central nervous system infection in cattle; and simian immunodeficiency virus 
(SIV), which cause immune deficiency and encephalopathy in sub-human primates. 

[0051] A lenti viral genome is generally organized into a 5' long terminal repeat 
(LTR), the gag gene, the pol gene, the env gene, the accessory genes (nef, vif, vpr, vpu) and 
a 3' LTR. The viral LTR is divided into three regions called U3, R and U5. The U3 region 
contains the enhancer and promoter elements. The U5 region contains the polyadenylation 
signals. The R (repeat) region separates the U3 and U5 regions and transcribed sequences 
of the R region appear at both the 5' and 3' ends of the viral RNA. See, for example, "RNA 
Viruses: A Practical Approach" (Alan J. Cann, Ed., Oxford University Press, (2000)), O 
Narayan and Clements J. Gen. Virology 70:1617-1639 (1989), Fields et al. Fundamental 
Virology Raven Press. (1990), Miyoshi H, Blomer U, Takahashi M, Gage FH, Verma IM. 
J Virol 72(10):8150-7 (1998), and U.S. Patent No. 6,013,516. 

[0052] Lenti viral vectors are known in the art, including several that have been 
used to transfect hematopoietic stem cells. Such vectors can be found, for example, in the 
following publications, which are incorporated herein by reference: Evans JT et al. Hum 
Gene Ther 1999;10:1479-1489; Case SS, Price MA, Jordan CT et al Proc Natl Acad Sci 
USA 1999;96:2988-2993; Uchida N, Sutton RE, Friera AM et al. Proc Natl Acad Sci USA 
1998;95:11939-1 1944; Miyoshi H, Smith KA, Mosier DE et al. Science 1999;283:682-686; 
Sutton RE, Wu HT, Rigg R et al. Human immunodeficiency virus type 1 vectors efficiently 
transduce human hematopoietic stem cells. J Virol 1998;72:5781-5788. 

[0053] "Virion," "viral particle" and "retroviral particle" are used herein to refer 
to a single virus comprising an RNA genome, pol gene derived proteins, gag gene derived 
proteins and a lipid bilayer displaying an envelope (glyco)protein. The RNA genome is 
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usually a recombinant RNA genome and thus may contain an RNA sequence that is 
exogenous to the native viral genome. The RNA genome may also comprise a defective 
endogenous viral sequence. 

[0054] A "pseudotyped" retrovirus is a retroviral particle having an envelope 
protein that is from a virus other than the virus from which the RNA genome is derived. 
The envelope protein may be from a different retrovirus or from a non-retroviral virus. A 
preferred envelope protein is the vesicular stomatitius virus G (VSV G) protein. However, 
to eliminate the possibility of human infection, viruses can alternatively be pseudotyped 
with ecotropic envelope protein that limit infection to a specific species, such as mice or 
birds. For example, in one embodiment, a mutant ecotropic envelope protein is used, such 
as the ecotropic envelope protein 4.17 (Powell et al. Nature Biotechnology 18(12):1279- 
1282 (2000)). 

[0055] The term "provirus" is used to refer to a duplex DNA sequence present 
in a eukaryotic chromosome that corresponds to the genome of an RNA retrovirus. The 
provirus may be transmitted from one cell generation to the next without causing lysis or 
destruction of the host cell. 

[0056] A "self-inactivating 3' LTR" is a 3* long terminal repeat (LTR) that 
contains a mutation, substitution or deletion that prevents the LTR sequences from driving 
expression of a downstream gene. A copy of the U3 region from the 3' LTR acts as a 
template for the generation of both LTR's in the integrated provirus. Thus, when the 3' 
LTR with an inactivating deletion or mutation integrates as the 5' LTR of the provirus, no 
transcription from the 5* LTR is possible. This eliminates competition between the viral 
enhancer/promoter and any internal enhancer/promoter. Self-inactivating 3' LTRs are 
described, for example, in Zufferey et al. J. Virol 72:9873-9880 (1998), Miyoshi et al. J. 
Virol 72:8150-8157 andlwakumaet al. Virology 261: 120-132 (1999). 

[0057] The term "RNA interference or silencing" is broadly defined and 
includes all posttranscriptional and transcriptional mechanisms of RNA mediated inhibition 
of gene expression, such as those described in (P.D. Zamore Science 296, 1265 (2002)). 

[0058] In one aspect of the invention, a recombinant retrovirus is used to deliver 
a transgene comprising an RNA coding region of interest to a target cell Preferably the 
target cell is a mammalian cell. The cell may be a primary cell, or may be a cultured cell, 
for example an without limitation an HEK, CHO, COS, MEF, 293 cell. In one embodiment 
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the target cell is an oocyte or an embryonic cell, more preferably a one-cell embryo. The 
RNA coding region and any associated genetic elements are thus integrated into the genome 
of the target cell as a provirus. When the target cell is an embryo, the cell may then be 
allowed to develop into a transgenic animal by methods well known in the art. 

[0059] The recombinant retrovirus used to deliver the RNA coding region is 
preferably a modified lentivirus, and thus is able to infect both dividing and non-dividing 
cells. The recombinant retrovirus preferably comprises a modified lentiviral genome that 
includes the transgene. Further, the modified lentiviral genome preferably lacks 
endogenous genes for proteins required for viral replication, thus preventing undesired 
replication, such as replication in a resulting transgenic animal. The required proteins are 
preferably provided in trans in the packaging cell line during production of the recombinant 
retrovirus, as described below. 

[0060] In another embodiment, the recombinant retrovirus used to deliver the 
RNA coding region is a modified Moloney virus, for example a Moloney Murine Leukemia 
Virus. In a further embodiment, the virus is a Murine Stem Cell Virus (Hawley, R. G., et 
al. (1996) Proc. Natl. Acad. Sci. USA 93:10297-10302; Keller, G., et al. (1998) Blood 
92:877-887; Hawley, R. G., et al. (1994) Gene Ther. 1:136-138). The recombinant 
retrovirus also can be a hybrid virus such as that described in Choi, JK; Hoanga, N; Vilardi, 
AM; Conrad, P; Emerson, SG; Gewirtz, AM. (2001) Hybrid HIV/MSCV LTR Enhances 
Transgene Expression of Lentiviral Vectors in Human CD34+ Hematopoietic Cells. Stem 
Ce/k 19, No. 3, 236-246. 

[00611 In the preferred embodiment the transgene is incorporated into a viral 
construct that comprises an intact retroviral 5' LTR and a self-inactivating 3' LTR. The 
viral construct is preferably introduced into a packaging cell line that packages viral 
genomic RNA based on the viral construct into viral particles with the desired host 
specificity. Viral particles are collected and used to infect the host cell. Each of these 
aspects is described in detail below. 

The Viral Construct 

[0062] The viral construct is a nucleotide sequence that comprises sequences 
necessary for the production of recombinant retrovirus in a packaging cell. In one 
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embodiment the viral construct additionally comprises genetic elements that allow for the 
desired expression of an RNA molecule or gene of interest in the host. 

[0063] Generation of the viral construct can be accomplished using any suitable 
genetic engineering techniques well known in the art, including, without limitation, the 
standard techniques of PCR, oligonucleotide synthesis, restriction endonuclease digestion, 
ligation, transformation, plasmid purification, and DNA sequencing, for example as 
described in Sambrook et al. (Molecular Cloning: A Laboratory Manual Cold Spring 
Harbor Laboratory Press, N.Y. (1989)), Coffin et al. (Retroviruses. Cold Spring Harbor 
Laboratory Press, N.Y. (1997)) and "RNA Viruses: A Practical Approach" (Alan J. Cann, 
Ed., Oxford University Press, (2000)). 

[0064] The viral construct may incorporate sequences from the genome of any 
known organism. The sequences may be incorporated in their native form or may be 
modified in any way. For example, the sequences may comprise insertions, deletions or 
substitutions. In the preferred embodiment the viral construct comprises sequences from a 
lentivirus genome, such as the HIV genome or the SIV genome. 

[0065] The viral construct preferably comprises sequences from the 5' and 3' 
LTRs of a lentivirus. More preferably the viral construct comprises the R and U5 
sequences from the 5' LTR of a lentivirus and an inactivated or self-inactivating 3' LTR 
from a lentivirus. The LTR sequences may be LTR sequences from any lentivirus from any 
species. For example, they may be LTR sequences from HIV, SIV, FIV or BIV. Preferably 
the LTR sequences are HIV LTR sequences. The virus also can incorporate sequences 
from MMV or MSCV. 

[0066] The viral construct preferably comprises an inactivated or self- 
inactivating 3' LTR. The 3' LTR may be made self-inactivating by any method known in 
the art. In the preferred embodiment the U3 element of the 3' LTR contains a deletion of 
its enhancer sequence, preferably the TATA box, Spl and NF-kappa B sites. As a result of 
the self-inactivating 3' LTR, the provirus that is integrated into the host cell genome will 
comprise an inactivated 5' LTR. 

[0067] Optionally, the U3 sequence from the lenti viral 5' LTR may be replaced 
with a promoter sequence in the viral construct. This may increase the titer of virus 
recovered from the packaging cell line. An enhancer sequence may also be included. Any 
enhancer/promoter combination that increases expression of the viral RNA genome in the 
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packaging cell line may be used. In the preferred embodiment the CMV enhancer/promoter 
sequence is used (U.S. Patent No. 5,168,062; Karasuyama et al J. Exp. Med. 169:13 (1989). 

[0068] The viral construct also comprises a transgene. The transgene, may be 
any nucleotide sequence, including sequences that serve as markers for the provirus. 
Preferably the transgene comprises one or more RNA coding regions and/or one or more 
genes of interest. Schematic diagrams of exemplary retroviral constructs are shown in 
Figures 1 A and IB. 

[0069] In the preferred embodiment the transgene comprises at least one RNA 
coding region. Preferably the RNA coding region is a DNA sequence that can serve as a 
template for the expression of a desired RNA molecule in the host cell. In one 
embodiment, the viral construct comprises two or more RNA coding regions. 

[0070] The viral construct also preferably comprises at least one RNA 
Polymerase III promoter. The RNA Polymerase IE promoter is operably linked to the RNA 
coding region and can also be linked to a termination sequence. In addition, more than one 
RNA Polymerase III promoter may be incorporated. 

[0071] RNA Polymerase ED promoters are well known to one of skill in the art. 
A suitable range of RNA Polymerase HI promoters can be found, for example, in Paule and 
White. Nucleic Acids Research., Vol 28, pp 1283-1298 (2000), which is hereby 
incorporated by reference in its entirety. The definition of RNA Polymerase III promoters 
also include any synthetic or engineered DNA fragment that can direct RNA Polymerase m 
to transcribe a downstream RNA coding sequence. Further, the RNA Polymerase HI (Pol 
III) promoter or promoters used as part of the viral vector can be inducible. Any suitable 
inducible Pol IE promoter can be used with the methods of the invention. Particularly 
suited Pol IE promoters include the tetracycline responsive promoters provided in Ohkawa 
and Taira Human Gene Therapy, Vol. 1 1, pp 577-585 (2000) and in Meissner et al Nucleic 
Acids Research, Vol. 29, pp 1672-1682 (2001), which are incorporated herein by reference. 

[0072] In one embodiment the transgene comprises a gene of interest that 
encodes a protein that is desirably expressed in one or more cells of a transgenic animal, for 
example, a reporter or marker protein. Preferably the gene of interest is located between the 
5' LTR and 3' LTR sequences. Further, the gene of interest is preferably in a functional 
relationship with other genetic elements, for example transcription regulatory sequences 
such as promoters and/or enhancers, to regulate expression of the gene of interest in a 
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particular manner once the transgene is incorporated into the host genome. In certain 
embodiments, the useful transcriptional regulatory sequences are those that are highly 
regulated with respect to activity, both temporally and spatially. 

[0073J Preferably the gene of interest is in a functional relationship with internal 
Polymerase II promoter/enhancer regulatory sequences. An "internal" promoter/enhancer is 
one that is located between the 5' LTR and the 3' LTR sequences in the viral construct and 
is operably linked to the gene that is desirably expressed. 

[0074] The Polymerase II promoter/enhancer may be any promoter, enhancer or 
promoter/enhancer combination known to increase expression of a gene with which it is in 
a functional relationship. 

[0075) The internal promoter/enhancer is preferably selected based on the 
desired expression pattern of the gene of interest and the specific properties of known 
promoters/enhancers. Thus, the internal promoter may be a constitutive promoter. Non- 
limiting examples of constitutive promoters that may be used include the promoter for 
ubiquitin, CMV (U.S. Patent No. 5,168,062; Karasuyama et al J. Exp. Med. 169:13 (1989), 
B-actin (Gunning et al. Proc. Natl. Acad. Sci. USA 84:4831-4835 (1987) and pgk (see, for 
example, U.S. Patent Nos. 4,615,974 and 5,104,795; Adra et al. Gene 60:65-74 (1987), 
Singer-Sam et al. Gene 32:409-417 (1984) and Dobson et al. Nucleic Acids Res. 10:2635- 
2637 (1982)). Alternatively, the promoter may be a tissue specific promoter. Several non- 
limiting examples of tissue specific promoters that may be used include lck (see, for 
example, Garvin et al. Mol. Cell Biol. 8:3058-3064 (1988) and Takadera et al. Mol. Cell 
Biol. 9:2173-2180 (1989)), myogenin (Yee et al. Genes and Development 7:1277-1289 
(1993), and thyl (Gundersen et al. Gene 113:207-214 (1992). In addition, promoters may 
be selected to allow for inducible expression of the transgene. A number of systems for 
inducible expression using such a promoter are known in the art, including the tetracycline 
responsive system and the lac operator-repressor system. It is also contemplated that a 
combination of promoters may be used to obtain the desired expression of the gene of 
interest. The skilled artisan will be able to select a promoter based on the desired 
expression pattern of the gene in the resulting transgenic animal. 

[0076] An internal enhancer may also be present in the viral construct to 
increase expression of the gene of interest. For example the CMV enhancer (Karasuyama 
et al J. Exp. Med. 169:13 (1989) may be used in combination with the chicken B-actin 
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promoter (see, e.g., JP 1990005890-A1). Again, one of skill in the art will be able to select 
the appropriate enhancer based on the desired expression pattern. 

[0077] The gene of interest is not limited in any way and includes any gene that 
the skilled practitioner desires to have integrated and/or expressed in a transgenic animal. 
For example, the gene of interest may be one that encodes a protein that serves as a marker 
to identify cells comprising the provirus. In other embodiments the gene of interest 
encodes a protein that modifies a physical characteristic of the transgenic animal, such as a 
protein that modifies size, growth, or tissue composition. In another example the gene of 
interest may encode a protein of commercial value that may be harvested from the 
transgenic animal. 

10078] In addition, more than one gene of interest may be placed in functional 
relationship with the internal promoter. For example a gene encoding a marker protein may 
be placed after the primary gene of interest to allow for identification of cells that are 
expressing the desired protein. In one embodiment a fluorescent marker protein, preferably 
green fluorescent protein (GFP), is incorporated into the construct along with the gene of 
interest. If a second reporter gene is included, an internal ribosomal entry site (IRES) 
sequence is also preferably included (U.S. Patent No. 4,937,190). The IRES sequence may 
facilitate the expression of the reporter gene. 

[0079] The viral construct may also contain additional genetic elements. The 
types of elements that may be included in the construct are not limited in any way and will 
be chosen by the skilled practitioner to achieve a particular result. For example, a signal 
that facilitates nuclear entry of the viral genome in the target cell may be included. An 
example of such a signal is the HTV-1 flap signal. 

[0080] Further, elements may be included that facilitate the characterization of 
the provirus integration site in the genome of the animal. For example, a tRNA amber 
suppressor sequence may be included in the construct. 

[0081] In addition, the construct may contain one or more genetic elements 
designed to enhance expression of the gene of interest. For example, a woodchuck hepatitis 
virus responsive element (WRE) may be placed into the construct (Zufferey et al. J. Virol 
74:3668-3681 (1999); Deglon et al. Hum. Gene Ther. 1 1:179-190 (2000)). 

[0082] A chicken P-globin insulator (Chung et al. Proc. Natl Acad. Sci USA 
94:575-580 (1997)) may also be included in the viral construct. This element has been 
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shown to reduce the chance of silencing the integrated provirus in the transgenic animal due 
to methylation and heterochromatinization effects. In addition, the insulator may shield the 
internal enhancer, promoter and exogenous gene from positive or negative positional 
effects from surrounding DNA at the integration site on the chromosome. 

[0083] Any additional genetic elements are preferably inserted 3' of the gene of 
interest. 

[0084] In a specific embodiment, the viral vector comprises: a cytomegalovirus 
(CMV) enhancer/promoter sequence; the R and U5 sequences from the HIV 5' LTR; the 
HTV-1 flap signal; an internal enhancer; an internal promoter; a gene of interest; the 
woodchuck hepatitis virus responsive element; a tRNA amber suppressor sequence; a U3 
element with a deletion of its enhancer sequence; the chicken p-globin insulator; and the R 
and U5 sequences of the 3 ' HTV LTR. 

[0085] The viral construct is preferably cloned into a plasmid that may be 
transfected into a packaging cell line. The preferred plasmid preferably comprises 
sequences useful for replication of the plasmid in bacteria. 

Production of Virus 

[0086] Any method known in the art may be used to produce infectious 
retroviral particles whose genome comprises an RNA copy of the viral construct described 
above. 

[0087] Preferably, the viral construct is introduced into a packaging cell line. 
The packaging cell line provides the viral proteins that are required in trans for the 
packaging of the viral genomic RNA into viral particles. The packaging cell line may be 
any cell line that is capable of expressing retroviral proteins. Preferred packaging cell lines 
include 293 (ATCC CCL X), HeLa (ATCC CCL 2), D17 (ATCC CCL 183), MDCK 
(ATCC CCL 34), BHK (ATCC CCL-10) and Cf2Th (ATCC CRL 1430). The most 
preferable cell line is the 293 cell line. 

[0088] The packaging cell line may stably express the necessary viral proteins. 
Such a packaging cell line is described, for example, in U.S. Patent No. 6,218,181. 
Alternatively a packaging cell line may be transiently transfected with plasmids comprising 
nucleic acid that encodes the necessary viral proteins. 
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[0089] In one embodiment a packaging cell line that stably expresses the viral 
proteins required for packaging the RNA genome is transfected with a plasmid comprising 
the viral construct described above. 

[0090] In another embodiment a packaging cell line that does not stably express 
the necessary viral proteins is co-transfected with two or more plasmids essentially as 
described in Yee et al. (Methods Cell. Biol. 43A, 99-112 (1994)). One of the plasmids 
comprises the viral construct comprising the transgene. The other plasmid(s) comprises 
nucleic acid encoding the proteins necessary to allow the cells to produce functional virus 
that is able to infect the desired host cell. 

[0091] The packaging cell line may not express envelope gene products. In this 
case the packaging cell line will package the viral genome into particles that lack an 
envelope protein. As the envelope protein is responsible, in part, for the host range of the 
viral particles, the viruses are preferably pseudotyped. Thus the packaging cell line is 
preferably transfected with a plasmid comprising sequences encoding a membrane- 
associated protein that will permit entry of the virus into a host cell. One of skill in the art 
will be able to choose the appropriate pseudotype for the host cell that is to be used. For 
example, in one embodiment the viruses are pseudotyped with the vesicular stomatitis virus 
envelope glycoprotein (VSVg). In addition to conferring a specific host range this 
pseudotype may permit the virus to be concentrated to a very high titer. Viruses can 
alternatively be pseudotyped with ecotropic envelope proteins that limit infection to a 
specific species, such as mice or birds. For example, in another embodiment, a mutant 
ecotropic envelope protein is used, such as the ecotropic envelope protein 4.17 (Powell et 
al. Nature Biotechnology 18(12):1279-1282 (2000)). 

[0092] In the preferred embodiment a packaging cell line that does not stably 
express viral proteins is transfected with the viral construct, a second vector comprising the 
HIV-1 packaging vector with the env, nef, 5'LTR, 3'LTR and vpu sequences deleted, and a 
third vector encoding an envelope glycoprotein. Preferably the third vector encodes the 
VSVg envelope glycoprotein. 

[0093] In another embodiment of invention, RNA interference activity of the 
packaging cells is suppressed to improve the production of recombinant virus. This 
includes, without limitation, the use of cotransfection or stable transfection of constructs 
expressing siRNA molecules to inhibit Dicer, an RNase III family member of ribonuclease 
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which is essential for RNA interference (Hammond et al. Nat. Rev. Genet. 2:110-119 

[0094] The recombinant virus is then preferably purified from the packaging 
cells, titered and diluted to the desired concentration. 

Transgenic Animals 

[0095] In order to make transgenic animals, an oocyte or one or more embryonic 
cells are infected with the recombinant virus produced as described above. One of skill in 
the art will recognize that the method of infection and the treatment of the cell following 
infection will depend upon the type of animal from which the cell is obtained. For 
example, mammalian cells are preferably implanted in a pseudopregnant female following 
infection while for the generation of transgenic birds or fish, the virus is preferably 
delivered to a laid egg and thus implantation is not required. 

[0096] While early methods of making transgenic animals required the cells to 
be rapidly dividing, there is no such requirement in the methods of the present invention. 
Thus the cell may be contacted at any point in development. In the preferred embodiment, 
a zygote is contacted with the recombinant virus. 

[0097] The cells to be infected with the virus may be obtained by any method 
known in the art and appropriate for the specific species in which it is desired to make a 
transgenic animal. For example, the recovery of fertilized mouse oocytes is described in 
Hogan et al. (Manipulating the Mouse Embryo: A Laboratory Manual. 2 nd ed. Cold 
Spring Harbor Laboratory Press, NY (1994)). A method for obtaining fertilized rat oocytes 
is described in Armstrong et al. (Biol. Reprod. 39,511-518 (1998)). 

[0098] It is not necessary that the cells be contacted after fertilization. In one 
embodiment, the virus is delivered to unfertilized ova. Development may then be 
initialized, for example by in vitro fertilization. 

Delivery of the Virus 

[0099] The virus may be delivered to the cell in any way that allows the virus to 
infect the cell. Preferably the virus is allowed to contact the cell membrane. Two preferred 
methods of delivering the virus to mammalian cells, injection and direct contact, are 
described below. 
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Injection 

[0100] In a first embodiment the virus is injected into the perivitelline space 
between the zona pellucida and the cell membrane of a single-cell zygote. Preferably less 
than 50 picoliters of viral suspension is injected, more preferably less than 25 picoliters and 
even more preferably about 10 picoliters. 

[0101] The virus is preferably present in a viral suspension and may be injected 
by any method known in the art. The viral suspension is preferably injected through a 
hydraulic injector. More preferably a glass micropipette is used to inject the virus. In one 
embodiment a micropipette is prepared by pulling borosilicate glass capillary on a pipette 
puller. The tip is preferably opened and beveled to approximately 10 urn. The lentiviral 
suspension may be loaded into the micropipette from the tip using gentle negative pressure. 

[0102] In one embodiment the cell is stabilized with a holding pipette mounted 
on a micromanipulator, such as by gentle negative pressure against a fire-polished pipette, 
and a second micromanipulator is used to direct the tip of a micropipette into the space 
between the zona pellucida and the cell membrane, where the virus is injected. 

Direct contact 

[0103] In another embodiment the zona pellucida is removed from the cell to 
produce a denuded embryo and the cell membrane is contacted with the virus. The zona 
pellucida may be removed by any method known in the art. Preferably it is removed by 
enzymatic treatment. For example, treatment with pronase may be used to remove the zona 
pellucida while the cell membrane is kept intact. Alternatively, the cell may be placed in 
media at pH at which the zona pellucida dissolves while the cell membrane remains intact. 
For example the cell may be incubated in an acidic Tyrode's solution at room temperature 
for several minutes. Once the zona pellucida is removed, any method that allows for the 
virus to contact the cell membrane may be used. Preferably, the cell is incubated in a 
solution containing the virus. Even more preferably, the solution is media that facilitates 
survival of the cell. 

[0104] In this embodiment, the cells are preferably contacted with the virus in 
culture plates. The virus may be suspended in media and added to the wells of a multi-well 
culture plate. The cells may then be plated in the individual wells. The media containing 
the virus may be added prior to the plating of the cells or after the cells have been plated. 
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Preferably individual cells are incubated in approximately 10 ^1 of media. However, any 
amount of media may be used as long as an appropriate concentration of virus in the media 
is maintained such that infection of the host cell occurs. 

[0105] The cells are preferably incubated with the virus for a sufficient amount 
of time to allow the virus to infect the cells. Preferably the cells are incubated with virus 
for at least 1 hour, more preferably at least 5 hours and even more preferably at least 10 
hours. 

[0106] Both the injection and direct contact embodiments may advantageously 
be scaled up to allow high throughput transgenesis. Because of the relative simplicity of 
the injection technique, it is possible to inject many embryos rapidly. For example, it is 
possible to inject more than 200 fertilized oocytes in less than one hour. With regard to the 
direct contact embodiment, any number of embryos may be incubated in the viral 
suspension simultaneously. This may be accomplished, for example, by plating the desired 
number of single-cell zygotes in multi-well tissue culture plates containing the virus 
suspended in media appropriate for the survival and growth of the cells. 

[0107] In both embodiments, any concentration of virus that is sufficient to 
infect the cell may be used. Preferably the concentration is at least 1 pfu/^1, more 
preferably at least 10 pfu/jil, even more preferably at least 400 pfu/^il and even more 
preferably at least 1 x 10 4 pfu/^il. 

[0108] Following infection with the virus, the cells are preferably implanted in 
an animal. More preferably cells infected with the virus are implanted in pseudo-pregnant 
animals of the same species from which the infected cells were obtained. Methods of 
creating pseudo-pregnancy in animals and implanting embryos are well known in the art 
and are described, for example, in Hogan et al. (Manipulating the Mouse Embryo: A 
Laboratory Manual. 2 nd ed. Cold Spring Harbor Laboratory Press, NY (1994)). 

[0109] In the preferred embodiment early stage embryos (approximately 0 - 2.5 
days px.) still with an intact zona pellucida are transferred to the oviduct of timed 
pseudopregnant female (preferably 0.5 days p.c.), while embryos that have reached the 
blastocyst stage are transferred to the uterus of timed pseudopregnant females (preferably 
2.5 days p.c). Denuded embryos are preferably cultured in vitro until they reach the morula 
or blastocyst stage (48 to 72 hours in culture), and are then implanted into appropriately 
timed pseudopregnant females. 
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[0110] The embryos and resulting animals may be analyzed, for example for 
integration of the transgene, the number of copies of the transgene that integrated, the 
location of the integration, the ability to transmit the transgene to progeny and expression of 
the transgene. Such analysis may be carried out at any time and may be carried out by any 
methods known in the art. Standard techniques are described, for example, in Hogan et al. 
(supra). 

[0111] The methods of infecting cells disclosed above do not depend upon 
species-specific characteristics of the cells. As a result, they are readily extended to all 
mammalian species. 

[0112] Initial experiments with mice indicate that of those animals that develop 
to full term, 80-90% carried at least one copy of the transgene and that, of these, 
approximately 85% express the gene of interest. Of the transgenic animals about 25% carry 
only 1 or 2 copies of the transgene. The highest number of pro viral insertions observed was 
about 30. Of the animals that carried only 1 or 2 copies of the transgene, about 80% 
expressed the gene of interest. 

[0113] As discussed above, the modified retrovirus can be pseudotyped to 
confer upon it a broad host range. One of skill in the art would also be aware of appropriate 
internal promoters to achieve the desired expression of a gene of interest in a particular 
animal species. Thus, one of skill in the art will be able to modify the method of infecting 
cells to create transgenic animals of any species. 

[01 14] In one embodiment, transgenic birds are created by delivering a modified 
retrovirus, as described above, to the primordial germ cells of early stage avian embryos. 
Freshly laid eggs are obtained and placed in a temperature controlled, humidified incubator. 
Preferably, the embryonic blastodisc in the egg is gradually rotated to lie on top of the yolk. 
This may be accomplished by any method known in the art, such as by gently rocking the 
egg regularly, preferably every 15 minutes. Approximately 36 hours later, the modified 
retrovirus is delivered into the space between the embryonic disk and the perivitelline 
membrane. Preferably about 50 nL of viral solution is delivered, more preferably about 100 
nL of viral solution is delivered, and even more preferably about 200 nL of viral solution is 
delivered. The viral solution may be delivered by any method known in the art for 
delivering compositions to the inside of an egg. In the preferred embodiment a window is 
opened in the shell, the viral solution is injected through the window and the shell window 
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is closed. The eggs are preferably incubated until hatching. The eggs will hatch after 
approximately 20 days, depending upon the particular avian species from which they are 
obtained. Hatched chicks are preferably raised to sexual maturity and mated. The 
transgenic offspring of the founder animals may be identified by any method known in the 
art, such as Southern blot, PCR and expression analysis. 

[0115] In another embodiment, transgenic fish are created by delivering the 
modified retrovirus, described above, to single-cell fish embryos. Fertilized fish eggs are 
collected by any method known in the art. The modified retrovirus is then preferably 
delivered to the space between the chorion and the cell membrane. This may be 
accomplished, for example, by loading the modified retrovirus in solution into a glass 
pipette. The pipette may then be used to pierce the chorion membrane and deliver the viral 
suspension. Preferably about 50 nL of viral solution is delivered, more preferably about 
100 nL of viral solution is delivered, and even more preferably about 200 nL of viral 
solution is delivered. Injected embryos are preferably returned to a temperature-controlled 
water tank and allowed to mature. At sexual maturity the founder fish are preferably mated 
and their progeny analyzed for the presence of the transgene by any method known in the 
art. 

[0116] As mentioned above, the methods of the present invention will also 
prove useful in techniques for identifying genes that are involved in specific biological 
processes, such as gene trap assays and large-scale mutagenesis screens. Such methods are 
described in the copending provisional patent applications 60/322,031 filed on 9/13/2001 
and copending U.S. provisional patent application 60/347,782 filed on 1/9/2002. 

Down-regulating Gene Expression in a Target Cell 

[01 1 7] The methods described herein allow the expression of RNA molecules in 
cells, and are particularly suited to the expression of small RNA molecules, which can not 
be readily expressed from a Pol II promoter. According to a preferred embodiment of the 
invention, an RNA molecule is expressed within a cell in order to down-regulate the 
expression of a target gene. The ability to down-regulate a target gene has many 
therapeutic and research applications, including identifying the biological functions of 
particular genes. Using the techniques and compositions of the invention, it will be 
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possible to knock-down (or down-regulate) the expression of a large number of genes, both 
in cell culture and in mammalian organisms. 

[0118] In preferred embodiments of the invention, an RNA expression cassette 
comprises a Pol HI promoter and an RNA coding region. The RNA coding region 
preferably encodes an RNA molecule that is capable of down-regulating the expression of a 
particular gene or genes. The RNA molecule encoded can, for example, be complementary 
to the sequence of an RNA molecule encoding a gene to be down-regulated. In such an 
embodiment, the RNA molecule preferably acts through an antisense mechanism. 

[0119] A more preferred embodiment involves the expression of a double- 
stranded RNA complex, or an RNA molecule having a stem-loop or a so-called "hairpin" 
structure. As used herein, the term "RNA duplex" refers to the double stranded regions of 
both the RNA complex and the double-stranded region of the hairpin or stem-lop structure. 

[0120] Double stranded RNA has been shown to inhibit gene expression of 
genes having a complementary sequence through a process termed RNA interference or 
suppression (see, for example, Hammond et al. Nat. Rev; Genet. 2:1 10-1 19 (2001)). 

[0121] According to the invention, an RNA duplex or siRNA corresponding to 
a region of a gene to be down-regulated is expressed in the cell. The RNA duplex is 
substantially identical (typically at least about 80% identical, more preferably at least about 
90% identical) in sequence to the sequence of the gene targeted for down regulation. 
siRNA duplexes are described, for example, in Bummelkamp et al. Science 296:550-553 
(2202), Caplen et al. Proc. Natl. Acad. Sci. USA 98:9742-9747 (2001) and Paddison et al 
Genes & Devel. 16:948-958 (2002). 

[0122] The RNA duplex is generally at least about 15 nucleotides in length and 
is preferably about 15 to about 30 nucleotides in length. However, a significantly longer 
RNA duplex can be used effectively in some organisms. In a more preferred embodiment, 
the RNA duplex is between about 19 and 22 nucleotides in length. The RNA duplex is 
preferably identical to the target nucleotide sequence over this region. 

[0123] When the gene to be down regulated is in a family of highly conserved 
genes, the sequence of the duplex region can be chosen with the aid of sequence 
comparison to target only the desired gene. On the other hand, if there is sufficient identity 
among a family of homologous genes within an organism, a duplex region can be designed 
that would down regulate a plurality of genes simultaneously. 
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[0124] The duplex RNA can be expressed in a cell from a single retroviral 
construct. In the preferred embodiment, a single RNA coding region in the construct is a 
serves as a template for the expression of a self-complementary hairpin RNA, comprising a 
sense region, a loop region and an antisense region. This embodiment is illustrated in 
Figure 2, which shows a schematic view of an RNA expression cassette having an RNA Pol 
III promoter 100 operatively linked to an RNA coding region, having a sense region 110, a 
loop region 120, an antisense region 130 and a terminator region 140. The sense 110 and 
antisense 130 regions are each preferably about 15 to about 30 nucleotides in length. The 
loop region 120 preferably is about 2 to about 15 nucleotides in length, more preferably 
from about 4 to about 9 nucleotides in length. Following expression the sense and 
antisense regions form a duplex. 

[0125] In another embodiment, the retroviral construct comprises two RNA 
coding regions. The first coding region is a template for the expression of a first RNA and 
the second coding region is a template for the expression of a second RNA. Following 
expression, the first and second RNA's form a duplex. The retroviral construct preferably 
also comprises a first Pol HI promoter operably linked to the first RNA coding region and a 
second Pol HI promoter operably linked to the second RNA coding region. This 
embodiment is illustrated in Figure 3, which shows a schematic view of an RNA expression 
cassette having an RNA Polymerase m promoter 100 linked to a first RNA coding region 
110 and a first terminator sequence 140 and a second RNA polymerase III promoter 105 
linked to a second RNA coding region 1 1 5 and a second terminator 145. 

[0126] In yet another embodiment of the invention, the retroviral construct 
comprises a first RNA Pol EQ promoter operably linked to a first RNA coding region, and a 
second RNA Pol m promoter operably linked to the same first RNA coding region in the 
opposite direction, such that expression of the RNA coding region from the first RNA Pol 
m promoter results in a synthesis of a first RNA molecule as the sense strand and 
expression of the RNA coding region from the second RNA Pol in promoter results in 
synthesis of a second RNA molecule as an antisense strand that is substantially 
complementary to the first RNA molecule. In one such embodiment, both RNA 
Polymerase HI promoters are separated from the RNA coding region by termination 
sequences, preferably termination sequences having five consecutive T residues. Figure 4 
shows a schematic view of an RNA expression cassette having a first RNA Polymerase IE 
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promoter 100 linked to an RNA coding region 110 and a first terminator sequence 145. 
The expression cassette has a second RNA polymerase HI promoter 105 linked to the RNA 
coding region 115, the same sequence as 110 in reverse, and a second terminator 140. 

[0127] In further embodiments an RNA duplex is expressed using two or more 
retroviral constructs. In one embodiment, a first retroviral construct is used that directs the 
expression of a first RNA and a second retroviral construct is used that directs expression 
of a second RNA that is complementary to the first. Following expression the first and 
second RNAs form a duplex region. It is preferred, however, that the entire duplex region 
is introduced using retroviral particles derived from a single retroviral construct. As 
discussed above, several strategies for expressing a duplex RNA from a single viral 
construct are shown in Figures 2-4. 

[0128] The RNA duplexes may be flanked by single stranded regions on one or 
both sides of the duplex. For example, in the case of the hairpin, the single stranded loop 
region would connect the duplex region at one end. 

[0129] The RNA coding region is generally operatively linked to a terminator 
sequence. The pol HI terminators preferably comprise of stretches of 4 or more thymidine 
("T") residues. In a preferred embodiment, a cluster of 5 consecutive Ts is linked 
immediately downstream of the RNA coding region to serve as the terminator. In such a 
construct pol III transcription is terminated at the second or third T of the DNA template, 
and thus only 2 to 3 uridine ("IT*) residues are added to the 3' end of the coding sequence. 

[0130] The sequence of the RNA coding region, and thus the sequence of the 
RNA duplex, preferably is chosen to be complementary to the sequence of a gene whose 
expression is to be downregulated in a cell or organism. The degree of down regulation 
achieved with a given RNA duplex sequence for a given target gene will vary by sequence. 
One of skill in the art will be able to readily identify an effective sequence. For example, in 
order to maximize the amount of suppression in a transgenic animal, a number of sequences 
can be tested for their efficacy in cell culture prior to generating a transgenic animal. 

[0131] The methods of the present invention will find great commercial 
application, for example in biotechnology, medicine and agriculture. For example, in 
agriculture the described methods may be used to confer disease resistance by expressing in 
a cell or organism an siRNA that specifically down-regulates the expression of a gene 
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associated with a pathogen or disease state. In biotechnology, the ability to rapidly develop 
large numbers of transgenic animals with desired modulation of specific genes will allow 
for the analysis of gene function and the evaluation of compounds that potentially modulate 
gene expression, protein function, and are useful in treating a disease or disorder. In 
particular, by observing the effect of down-regulating specific genes in transgenic animals, 
the biological function of those genes may be determined. In medicine the methods of the 
invention may be used to treat patients suffering from particular diseases or disorders, such 
as HIV, or to confer immunity or resistance to particular pathogens. For example, specific 
cells may be infected in vivo or ex vivo with recombinant retrovirus encoding an siRNA 
that down-regulates the activity of a gene whose activity is associated with a particular 
disease or disorder. 

[0132] The following examples are offered for illustrative puiposes only, and 
are not intended to limit the scope of the present invention in any way. Indeed, various 
modifications of the invention in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description and fall within the scope 
of the appended claims. 

[0133] All patent and literature references cited in the present specification are 
hereby incorporated by reference in their entirety. 

Examples 

Example 1 

[0134] An lentiviral construct was constructed by insertion of an siRNA 
expression cassette into the Pad site of HC-FUGW vector (Figure 5; SEQ ID NO: 2). The 
siRNA was designed to down-regulate expression of the lacZ gene. The HC-FUGW vector 
comprised a GFP marker gene operably linked to the human Ubiquitin promoter. The GFP 
marker was useful for tracking transduction events. The vector also comprised an HIV 
DNA Flap element to improve the virus titers, and the WRE for high level expression of 
viral genes. The siRNA expression cassette was composed of a pol HI promoter and a 
small hairpin RNA coding region followed by a pol HI terminator site. The pol HI promoter 
(SEQ ID NO:3) was derived from the -240 to -9 region of human HI -RNA promoter and 
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was cloned as an Eco Rl fragment by PCR amplification from HEK293 genomic DNA. 
The pol DI promoter was connected to the downstream RNA coding region by a 7 base pair 
linker sequence to ensure that the transcription was precisely initiated at the first nucleotide 
of the RNA coding sequence. The small hairpin RNA coding region comprised a 19 nt 
sequence corresponding to the 1900-1918 region of the sense strand of the bacterial beta- 
galactosidase (lacZ) gene coding sequence and the 19 nt perfect reverse complementary 
sequence separated by a 9 nt loop region. The terminator was comprised of 5 consecutive 
thymidine residues linked immediately downstream of the RNA coding sequence. The 
sequence of the hairpin siRNA is shown in SEQ ID NO: 1 . 

Example 2 

[0135] Transduction of cultured mammalian cells with retrovirus derived from 
the retroviral construct described in Example 1 was achieved (Figure 6). The retroviral 
vector encoding a small hairpin RNA molecule described in Example 1, was used to 
transfect cultured mammalian cells that express lacZ. A profound decrease in the 
expression of the lacZ was observed. 

[0136] The lacZ siRNA virus was produced by cotransfection of the retroviral 
vector, a helper virus plasmid and VSVg expression plasmid in HEK293 cells. The virus 
particles were harvested from the cell culture supernatants and concentrated by 
ultracentrifugation. The concentrated virus preparations were used to infect either mouse 
embryonic fibroblasts (MEF) or HEK293 cells which harbor both lacZ and firefly luciferase 
(Luc) reporter genes. Infection was monitored by the GFP signal which is expressed from 
the marker gene cassette of the viral vector. Under the conditions of this experiment, >98% 
of the test cells were GPF+ and thus were successfully transduced. The expression levels 
of lacZ and Luc reporter genes were measured by chemiluminescent assays using 
commercially available kits (lacZ assay kit from Roche and Luc from Promega). The lacZ 
siRNA virus only inhibited the expression of lacZ but not Luc. The specific inhibition was 
determined by the ratio of lacZ activity to Luc activity. The lacZ/Luc ratio of the 
uninfected parental cells was arbitrarily set to 1 and the values of the infected cells were 
calculated accordingly. As shown in Figure 6, transfection with the virus resulted in 
dramatic reduction in the amount of expression of the lacZ gene in both MEK and HEK293 
cells. 
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[0137] A tet-iducible lacZ siRNA lentiviral vector was also prepared as 
illustrated in figure 8. A Tet repressor gene (TetR; SEQ ID NO: 5) was placed the under 
the control of the human UbiquitinC promoter so that its expression could be monitored by 
the downstream GFP marker. The anti-lacZ siRNA cassette was driven by a Tet-inducible 
pol HI promoter derived from human U6-promoter (-328 to +1) containing a single TetR 
binding site (TetOl) between the PSE and TATA box (SEQ ID NO: 4). The TetR coding 
sequence was PCR amplified from genomic DNA from the TOP 10 strain of E. coli adn 
cloned into a modified version of FUIGW as a Bgl2-EcoRl fragment. In the absence of 
tetracycline, TetR binds to the promoter and its expression is repressed. Upon the addition 
of tetracycline, TetR is moved from the promoter and transcription starts. 

[0138] The Tet-inducible siRNA expression cassette was able to regulate gene 
expression in response to Doxycycline treatment. Virus was prepared from the retroviral 
construct carrying the Tet-inducible lacZ-siRNA cassette and a Tet repressor under the 
control of a UbiquitinC promoter and used to transduce HEK293 cells expressing both lacZ 
and luciferase (293Z+Luc). The transduced cells were treated with 10 ug/ml Doxycycline 
(Plus Dox) for 48hr or without the Doxycycline treatment as a control (No Dox). LacZ and 
luciferase activities were measured as described in the previous figures. The relative 
suppression activity is calculated as the ratio of lacZ versus luciferase and No Dox control 
was arbitrarily set to 1. As can be seen in Figure 9, in the presence of doxycycline 
suppression of lacZ activity was significantly enhanced. 

Example 3 

[0139] This example demonstrates the generation of transgenic animals that 
express an siRNA molecule encoded by a lentiviral vector. The expression of the lacZ 
specific siRNA described in Example 1 resulted in extensive suppression of lacZ activity in 
ROSA26+/- mice. 

[0140] ROSA26+/- mice carry one copy of a ubiquitously expressed lacZ 
reporter gene. The lacZ siRNA virus preparations described in Example 2 were used for 
perivitelline injection of ROSA26+/- single cell embryos obtained from hormone primed 
C57B1/6 female donors x ROSA26+/+ stud males. The injected single cell embryos were 
subsequently transferred into the oviduct of timed pseudopregnant female recipients. 
Embryonic day 15.5 to 17.5 (E15.5-17.5) fetuses were recovered from the surrogate 
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mothers. Successful transgenesis was scored by positive GFP signal observed with the 
fetuses under fluorescent microscope. Protein extracts prepared from the limb tissues of the 
fetuses were used for the LacZ chemiluminescent assay according to the manufacturer's 
instruction (Roche), and protein concentrations of the tissue extracts were determined by 
the Bradford assay (BioRad). The lacZ expression levels were expressed as light units (LU) 
per ug of proteins (LU/ug). The E15.5-17.5 fetuses from the timed mating of C57B1/6 
females x ROSA26+/+ males and C57B1/6 females x C57B1/6 males were served as the 
positive and negative controls respectively. The results are shown in Figure 7. Animals 
G1-G4 (those treated derived from embroys infected with the virus comprising the siRNA 
construct) showed markedly decreased expression of the lacZ gene as compared with 
untreated control animals. 
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WHAT IS CLAIMED IS: 

1. A method of expressing an RNA molecule within a cell, the method 
comprising: 

transfecting a packaging cell line with a retroviral construct; 
recovering recombinant retrovirus from the packaging cell line; and 
infecting a target cell with the recombinant retrovirus, 

wherein the retroviral construct comprises the R and U5 sequences from a 5' 
lentiviral long terminal repeat (LTR), a self-inactivating lentiviral 3' LTR, an RNA 
Polymerase HI promoter region and an RNA coding region operably linked to an RNA 
Polymerase III promoter region. 

2. The method of claim 1, wherein the retroviral construct further comprises at 
least one termination sequence operably linked to the RNA coding region. 

3. The method of claim 1, wherein the RNA Polymerase HI promoter is 
inducible. 

4. The method of claim 3, wherein the inducible promoter is activated with 
tetracycline. 

5. The method of claim 1, wherein the RNA coding region encodes a self- 
complementary RNA molecule having a sense region, an antisense region and a loop 
region. 

6. The method of claim 5, wherein the loop region is about 2 to about 10 
nucleotides in length. 

7. The method of claim 5, wherein the sense region and the antisense region 
are each between about 15 and about 30 nucleotides in length. 

8. The method of claim 1, wherein the retroviral construct comprises a first 
RNA coding region operably linked to a first RNA Polymerase HI promoter and a second 
RNA coding region operably linked to a second RNA Polymerase m promoter 

9. The method of claim 8, wherein the first RNA coding region encodes a first 
RNA molecule and the second RNA coding region encodes a second RNA molecule. 

10. The method of claim 9, wherein the first RNA molecule and the second 
RNA molecule are substantially complementary. 

11. The method of claim 1, wherein the retroviral construct comprises a first 
RNA Polymerase DH promoter and a second RNA Polymerase HI promoter, each operably 
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linked to the RNA coding region, such that expression of the RNA coding region from the 
first RNA Polymerase III promoter results in the synthesis of a first RNA molecule and 
expression of the RNA coding region from the second RNA Polymerase HI promoter 
results in the synthesis of a second RNA molecule substantially complementary to the first 
RNA molecule. 

12. The method of claim 1, wherein expression of the RNA coding region 
results in the down regulation of a target gene, wherein the target gene comprises a 
sequence that is at least about 90% identical with the RNA coding region. 

1 3. The method of claim 1 wherein said packaging cell line is a 293 cell line. 

14. The method of claim 1 wherein the 5' LTR sequences are from HTV. 

15. The method of claim 1, wherein the viral construct comprises the 
woodchuck hepatitis virus enhancer element sequence. 

16. The method of claim 1, wherein the viral construct comprises a tRNA amber 
suppressor sequence. 

17. The method of claim 1 wherein the self-inactivating 3' LTR comprises a U3 
element with a deletion of its enhancer sequence, 

18. The method of claim 17, wherein the self-inactivating 3* LTR is a modified 
HIV 3' LTR. 

19. The method of claim 1 , wherein the recombinant retrovirus is pseudotyped. 

20. The method of claim 19, wherein the recombinant retrovirus is pseudotyped 
with the vesicular stomatitits virus envelope glycoprotein. 

21. The method of claim 1, wherein the viral construct further comprises a gene 
of interest. 

22. The method of claim 21, wherein the viral construct has a Polymerase II 
promoter operably linked to the gene of interest. 

23. The method of claim 22, wherein the promoter is a CMV promoter. 

24. The method of claim 22, wherein the viral construct additionally comprises 
an enhancer operably linked to the promoter. 

25. The method of claim 25, wherein the enhancer and promoter are CMV 
sequences. 

26. The method of claim 2 1 , wherein the gene of interest is a reporter gene. 
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27. The method of claim 26, wherein the reporter gene encodes a fluorescent 

protein. 

28. The method of claim 27, wherein the fluorescent protein is green fluorescent 

protein. 

29. The method of claim 20, wherein the polymerase II promoter is a ubiquitous 
promoter. 

30. The method of claim 27, wherein the ubiquitous promoter is selected from 
the group consisting of the ubiquitin promoter, the CMV (3-actin promoter and the pgk 
promoter. 

31. The method of claim 22, wherein the RNA Polymerase II promoter is a 
tissue specific promoter. 

32. The method of claim 31, wherein said tissue specific promoter is selected 
from the group consisting of the lck promoter, the myogenin promoter and the thyl 
promoter. 

33. The method of claim 1 , wherein the target cell is an embryonic cell. 

34. The method of claim 33, wherein the target cell is an embryogenic stem cell. 

35. The method of claim 33, further comprising implanting the embryo in a 
pseudopregnant female. 

36. The method of claim 1, wherein infecting a target cell comprises injecting 
the recombinant retrovirus between the zona pellucida and the cell membrane of a 
mammalian embryonic cell. 

37. The method of claim 1, wherein infecting a target cell comprises removing 
the zona pellucida from a mammalian embryonic cell and incubating the cell in solution 
containing the recombinant retrovirus. 

38. The method of claim 37, wherein the zona pellucida is removed by 
enzymatic digestion. 

39. The method of claim 1 , wherein the target cell is a cultured cell. 

40. The method of claim 39, wherein the target cell is a cultured mammalian 

cell. 

41. The method of claim 40, wherein the cultured mammalian cell is selected 
from the group consisting of CHO, HEK, COS and MEF cells. 

42. The method of claim 1 , wherein the target cell is an embryonic cell of a bird. 
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43. The method of claim 41, wherein infecting an embryonic cell of a bird 
comprises contacting the embryonic blastodisc of the bird egg with the recombinant 
retrovirus. 

44. The method of claim 1, wherein the target cell is a fish egg. 

45. The method of claim 42, wherein infecting the fish egg comprises delivering 
the recombinant retrovirus to the space between the chorion and the cell membrane of the 
fish egg. 

46. A retroviral construct for the expression of an RNA molecule within a cell, 
the retroviral construct comprising: 

a nucleic acid having the R and U5 sequences from a 5' lentiviral long terminal 
repeat (LTR); 

a self-inactivating lentiviral 3' LTR; and 
an RNA Polymerase in promoter, 

47. The retroviral construct of claim 46, further comprising at least one 
termination sequence. 

48. The retroviral construct of claim 46, wherein the RNA Polymerase III 
promoter is inducible. 

49. The retroviral construct of claim 48, wherein the inducible promoter is 
activated with tetracycline. 

50. The retroviral construct of claim 46, further comprising at least one RNA 
coding region operably linked to the RNA Polymerase III promoter. 

51. The retroviral construct of claim 50, wherein the RNA coding region 
encodes a self-complementary RNA molecule having a sense region, an antisense region 
and a loop region. 

52. The retroviral construct of claim 51, wherein the loop region is about 2 to 
about 10 nucleotides in length. 

53. The retroviral construct of claim 51, wherein the sense region and the 
antisense region are between about 15 and about 30 nucleotides in length. 

54. The retroviral construct of claim 50, wherein the RNA coding region 
encodes a first RNA molecule, and the retroviral construct further comprises a second RNA 
Polymerase HI promoter and a second RNA coding region operably linked to the second 
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RNA Polymerase III promoter, wherein the second RNA coding region encodes a second 
RNA molecule substantially complementary to the first RNA molecule. 

55. The retroviral construct of claim 50, wherein the retroviral construct further 
comprises a second RNA Polymerase in promoter operably linked to the RNA coding 
region, such that expression of the RNA coding region from the first RNA Polymerase m 
promoter results in a synthesis of a first RNA molecule and expression of the RNA coding 
region from the second RNA Polymerase HI promoter results in synthesis of a second RNA 
molecule substantially complementary to the first RNA molecule. 

56. The retroviral construct of claim 50, wherein expression of the RNA coding 
region results in the down regulation of a target gene,. 

57. The method of claim 56, wherein the target gene comprises a sequence that 
is at least about 90% identical with the RNA coding region. 

58. The retroviral construct of claim 46, wherein the 5' LTR sequences are from 

HIV. 

59. The retroviral construct of claim 46, wherein the viral construct comprises 
the woodchuck hepatitis virus enhancer element sequence. 

60. The retroviral construct of claim 46, wherein the viral construct comprises a 
tRNA amber suppressor sequence. 

61. The retroviral construct of claim 46, wherein the self-inactivating 3' LTR 
comprises a U3 element with a deletion of its enhancer sequence. 

62. The retroviral construct of claim 46, wherein the self-inactivating 3' LTR is 
a modified HIV 3' LTR. 

63. The retroviral construct of claim 46, wherein the recombinant retrovirus is 
pseudotyped. 

64. The retroviral construct of claim 63, wherein the recombinant retrovirus is 
pseudotyped with the vesicular stomatitits virus envelope glycoprotein. 

65. The retroviral construct of claim 46, wherein the viral construct further 
comprises a gene of interest. 

66. The retroviral construct of claim 65, wherein the viral construct has a 
Polymerase II promoter operably linked to the gene of interest. 

67. The retroviral construct of claim 66, wherein the RNA Polymerase II 
promoter is a CMV promoter. 
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68. The retroviral construct of claim 66, wherein the viral construct additionally 
comprises an enhancer operably linked to the RNA Polymerase II promoter. 

69. The retroviral construct of claim 68, wherein the enhancer and the RNA 
Polymerase II promoter are CMV sequences. 

70. The retroviral construct of claim 66, wherein the RNA Polymerase II 
promoter is a ubiquitous promoter. 

71. The retroviral construct of claim 70, wherein the ubiquitous promoter is 
selected from the group consisting of the ubiquitin promoter, the CMV p-actin promoter 
and the pgk promoter. 

72. The retroviral construct of claim 66, wherein the RNA Polymerase II 
promoter is a tissue specific promoter. 

73. The retroviral construct of claim 72, wherein said tissue specific promoter is 
selected from the group consisting of the lck promoter, the myogenin promoter and the thyl 
promoter. 

74. The retroviral construct of claim 65, wherein the gene of interest is a 
reporter gene. 

75. The retroviral construct of claim 74, wherein the reporter gene encodes a 
fluorescent protein. 

76. The retroviral construct of claim 75, wherein the fluorescent protein is green 
fluorescent protein. 
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FIGURE 1A 
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FIGURE 6 
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FIGURE 7 
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FIGURE 8 
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FIGURE 9 
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SEQUENCE LISTING 

<110> CALIFORNIA INSTITUTE OF TECHNOLOGY 
IN # XIAO -FENG 
LO I S - CABALLE , CARLOS 
BALTIMORE, DAVID 

<120> METHOD FOR EXPRESSION OF SMALL RNA 
MOLECULES WITHIN A CELL 

<130> CALTE.010VPC 

<150> 60/322,031 
<151> 2001-09-13 

<150> 60/347,782 
<151> 2002-01-09 

<150> 60/389,592 
<151> 2002-06-18 

<150> Not yet available (Attorney Docket No.: CALTE.011PR) 
<151> 2002-08-27 

<160> 5 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This represents a siRNA cassette comprising 

bacterial sequence and synthetic linker, loop and 
terminator sequences. 

<400> 1 

gatccccgtg accagcgaat acctgtttca agagaacagg tattcgctgg tcacttttt 59 

<210> 2 
<211> 9941 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence represents a lentiviral vector 

comprising a human immunodeficiency virus flap 
sequence, a green fluorescent protein variant 
sequence, a human ubiquitin promoter sequence and 
a woodchuck hepatitis regulator element sequence. 



<400> 2 

gtcgacggat cgggagatct cccgatcccc tatggtgcac tctcagtaca atctgctctg 60 
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atgccgcata gttaagccag tatctgctcc 
gcgcgagcaa aatttaagct acaacaaggc 
tgcttagggt taggcgtttt gcgctgcttc 
attgattatt gactagttat taatagtaat 
atatggagtt ccgcgttaca taacttacgg 
acccccgccc attgacgtca ataatgacgt 
tccattgacg tcaatgggtg gagtatttac 
tgtatcatat gccaagtacg ccccctattg 
attatgccca gtacatgacc ttatgggact 
tcatcgctat taccatggtg atgcggtttt 
ttgactcacg gggatttcca agtctccacc 
accaaaatca acgggacttt ccaaaatgtc 
gcggtaggcg tgtacggtgg gaggtctata 
ctctggttag accagatctg agcctgggag 
aagcctcaat aaagcttgcc ttgagtgctt 
tctggtaact agagatccct cagacccttt 
gcccgaacag ggacttgaaa gcgaaaggga 
ggcttgctga agcgcgcacg gcaagaggcg 
ttttgactag cggaggctag aaggagagag 
ggagaattag atcgcgatgg gaaaaaattc 
aattaaaaca tatagtatgg gcaagcaggg 
tgttagaaac atcagaaggc tgtagacaaa 
caggatcaga agaacttaga tcattatata 
aaaggataga gataaaagac accaaggaag 
aaagtaagac caccgcacag caagcggccg 
agggacaatt ggagaagtga attatataaa 
gtagcaccca ccaaggcaaa gagaagagtg 
ggagctttgt tccttgggtt cttgggagca 
acgctgacgg tacaggccag acaattattg 
ctgagggcta ttgaggcgca acagcatctg 
ctccaggcaa gaatcctggc tgtggaaaga 
tggggttgct ctggaaaact catttgcacc 
aataaatctc tggaacagat ttggaatcac 
aacaattaca caagcttaat acactcctta 
aatgaacaag aattattgga attagataaa 
acaaattggc tgtggtatat aaaattattc 
agaatagttt ttgctgtact ttctatagtg 
tcgtttcaga cccacctccc aaccccgagg 
gaaggtggag agagagacag agacagatcc 
gcgccaattc tgcagacaaa tggcagtatt 
tggggggtac agtgcagggg aaagaatagt 
agaattacaa aaacaaatta caaaaattca 
agatccagtt tggttaatta agggtgcagc 
cgggcgcccc cctcctcacg gcgagcgctg 
ctgatccttc cgcccggacg ctcaggacag 
aaccccagta tcagcagaag gacattttag 
ttttctttcc agagagcgga acaggcgagg 
agggatctcc gtggggcggt gaacgccgat 
agctagttcc gtcgcagccg ggatttgggt 
gtcacttggt gagttgcggg ctgctgggct 
cggtgggacg gaagcgtgtg gagagaccgc 
gttgccctga actgggggtt ggggggagcg 
atggaagacg cttgtaaggc gggctgtgag 
gcggcaagaa cccaaggtct tgaggccttc 
gatgggctgg ggcaccatct ggggaccctg 
gggtttgtcg tctggttgcg ggggcggcag 
acctttggga gcgcgcgcct cgtcgtgtcg 



ctgcttgtgt gttggaggtc gctgagtagt 120 
aaggcttgac cgacaattgc atgaagaatc 180 
gcgatgtacg ggccagatat acgcgttgac 240 
caattacggg gtcattagtt catagcccat 300 
taaatggccc gcctggctga ccgcccaacg 360 
atgttcccat agtaacgcca atagggactt 420 
ggtaaactgc ccacttggca gtacatcaag 480 
acgtcaatga cggtaaatgg cccgcctggc 540 
ttcctacttg gcagtacatc tacgtattag 600 
ggcagtacat caatgggcgt ggatagcggt 660 
ccattgacgt caatgggagt ttgttttggc 720 
gtaacaactc cgccccattg acgcaaatgg 780 
taagcagcgc gttttgcctg tactgggtct 840 
ctctctggct aactagggaa cccactgctt 900 
caagtagtgt gtgcccgtct gttgtgtgac 960 
tagtcagtgt ggaaaatctc tagcagtggc 1020 
aaccagagga gctctctcga cgcaggactc 1080 
aggggcggcg actggtgagt acgccaaaaa 1140 
atgggtgcga gagcgtcagt attaagcggg 1200 
ggttaaggcc agggggaaag aaaaaatata 1260 
agctagaacg attcgcagtt aatcctggcc 1320 
tactgggaca gctacaacca tcccttcaga 1380 
atacagtagc aaccctctat tgtgtgcatc 1440 
ctttagacaa gatagaggaa gagcaaaaca 1500 
ctgatcttca gacctggagg aggagatatg 1560 
tataaagtag taaaaattga accattagga 1620 
gtgcagagag aaaaaagagc agtgggaata 1680 
gcaggaagca ctatgggcgc agcgtcaatg 1740 
tctggtatag tgcagcagca gaacaatttg 1800 
ttgcaactca cagtctgggg catcaagcag 1860 
tacctaaagg atcaacagct cctggggatt 1920 
actgctgtgc cttggaatgc tagttggagt 1980 
acgacctgga tggagtggga cagagaaatt 2040 
attgaagaat cgcaaaacca gcaagaaaag 2100 
tgggcaagtt tgtggaattg gtttaacata 2160 
ataatgatag taggaggctt ggtaggttta 2220 
aatagagtta ggcagggata ttcaccatta 2280 
ggacccgaca ggcccgaagg aatagaagaa 2340 
attcgattag tgaacggatc ggcactgcgt 24 00 
catccacaat tttaaaagaa aaggggggat 24 60 
agacataata gcaacagaca tacaaactaa 2520 
aaattttcgg gtttattaca gggacagcag 2580 
ggcctccgcg ccgggttttg gcgcctcccg 2640 
ccacgtcaga cgaagggcgc aggagcgttc 2700 
cggcccgctg ctcataagac tcggccttag 2760 
gacgggactt gggtgactct agggcactgg 2820 
aaaagtagtc ccttctcggc gattctgcgg 2880 
gattatataa ggacgcgccg ggtgtggcac 2940 
cgcggttctt gtttgtggat cgctgtgatc 3000 
ggccggggct ttcgtggccg ccgggccgct 3060 
caagggctgt agtctgggtc cgcgagcaag 3120 
cacaaaatgg cggctgttcc cgagtcttga 3180 
gtcgttgaaa caaggtgggg ggcatggtgg 3240 
gctaatgcgg gaaagctctt attcgggtga 3300 
acgtgaagtt tgtcactgac tggagaactc 3360 
ttatgcggtg ccgttgggca gtgcacccgt 3420 
tgacgtcacc cgttctgttg gcttataatg 3480 
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cagggtgggg ccacctgccg gtaggtgtgc 
gttcgggcct agggtaggct ctcctgaatc 
gataagtgag gcgtcagttt ctttggtcgg 
agctccggtt ttgaactatg cgctcggggt 
caccttttga aatgtaatca tttgggtcaa 
cttctgcagg tcgactctag aaaattgtcc 
ttagacagga tccccgggta ccggtcgcca 
ccggggtggt gcccatcctg gtcgagctgg 
tgtccggcga gggcgagggc gatgccacct 
ccaccggcaa gctgcccgtg ccctggccca 
agtgcttcag ccgctacccc gaccacatga 
ccgaaggcta cgtccaggag cgcaccatct 
gcgccgaggt gaagttcgag ggcgacaccc 
acttcaagga ggacggcaac atcctggggc 
acgtctatat catggccgac aagcagaaga 
acaacatcga ggacggcagc gtgcagctcg 
gcgacggccc cgtgctgctg cccgacaacc 
aagaccccaa cgagaagcgc gatcacatgg 
tcactctcgg catggacgag ctgtacaagt 
aagcttatcg ataatcaacc tctggattac 
aactatgttg ctccttttac gctatgtgga 
attgcttccc gtatggcttt cattttctcc 
tatgaggagt tgtggcccgt tgtcaggcaa 
gcaaccccca ctggttgggg cattgccacc 
ttccccctcc ctattgccac ggcggaactc 
ggggctcggc tgttgggcac tgacaattcc 
ccttggctgc tcgcctgtgt tgccacctgg 
ccttcggccc tcaatccagc ggaccttcct 
cttccgcgtc ttcgccttcg ccctcagacg 
catcgatacc gtcgacctcg agacctagaa 
gcagctacca atgctgattg tgcctggcta 
ccagtcacac ctcaggtacc tttaagacca 
cactttttaa aagaaaaggg gggactggaa 
atccttgatc tgtggatcta ccacacacaa 
ccagggccag ggatcagata tccactgacc 
gagcaagaga aggtagaaga agccaatgaa 
agcctgcatg ggatggatga cccggagaga 
ctagcatttc atcacatggc ccgagagctg 
gaccagatct gagcctggga gctctctggc 
taaagcttgc cttgagtgct tcaagtagtg 
tagagatccc tcagaccctt ttagtcagtg 
ccgctgatca gcctcgactg tgccttctag 
cgtgccttcc ttgaccctgg aaggtgccac 
aattgcatcg cattgtctga gtaggtgtca 
cagcaagggg gaggattggg aagacaatag 
ggcttctgag gcggaaagaa ccagctgggg 
cggcgcatta agcgcggcgg gtgtggtggt 
cgccctagcg cccgctcctt tcgctttctt 
tccccgtcaa gctctaaatc gggggctccc 
cctcgacccc aaaaaacttg attagggtga 
gacggttttt cgccctttga cgttggagtc 
aactggaaca acactcaacc ctatctcggt 
gatttcggcc tattggttaa aaaatgagct 
ctgtggaatg tgtgtcagtt agggtgtgga 
atgcaaagca tgcatctcaa ttagtcagca 
gcaggcagaa gtatgcaaag catgcatctc 
actccgccca tcccgcccct aactccgccc 



ggtaggcttt tctccgtcgc aggacgcagg 3540 
gacaggcgcc ggacctctgg tgaggggagg 3600 
ttttatgtac ctatcttctt aagtagctga 3660 
tggcgagtgt gttttgtgaa gttttttagg 3720 
tatgtaattt tcagtgttag actagtaaag 3780 
gctaaattct ggccgttttt ggcttttttg 3840 
ccatggtgag caagggcgag gagctgttca 3900 
acggcgacgt aaacggccac aagttcagcg 3960 
acggcaagct gaccctgaag ttcatctgca 4 020 
ccctcgtgac caccctgacc tacggcgtgc 4080 
agcagcacga cttcttcaag tccgccatgc 4140 
tcttcaagga cgacggcaac tacaagaccc 4200 
tggtgaaccg catcgagctg aagggcatcg 4260 
acaagctgga gtacaactac aacagccaca 4320 
acggcatcaa ggtgaacttc aagatccgcc 4380 
ccgaccacta ccagcagaac acccccatcg 4440 
actacctgag cacccagtcc gccctgagca 4500 
tcctgctgga gttcgtgacc gccgccggga 4560 
aaagcggccg cgactctaga attcgatatc 4620 
aaaatttgtg aaagattgac tggtattctt 4680 
tacgctgctt taatgccttt gtatcatgct 4740 
tccttgtata aatcctggtt gctgtctctt 4800 
cgtggcgtgg tgtgcactgt gtttgctgac 4 860 
acctgtcagc tcctttccgg gactttcgct 4 920 
atcgccgcct gccttgcccg ctgctggaca 4 980 
gtggtgttgt cggggaaatc atcgtccttt 5040 
attctgcgcg ggacgtcctt ctgctacgtc 5100 
tcccgcggcc tgctgccggc tctgcggcct 5160 
agtcggatct ccctttgggc cgcctccccg 5220 
aaacatggag caatcacaag tagcaataca 5280 
gaagcacaag aggaggagga ggtgggtttt 5340 
atgacttaca aggcagctgt agatcttagc 5400 
gggctaattc actcccaacg aagacaagat 5460 
ggctacttcc ctgattggca gaactacaca 5520 
tttggatggt gctacaagct agtaccagtt 5580 
ggagagaaca cccgcttgtt acaccctgtg 5640 
gaagtattag agtggaggtt tgacagccgc 5700 
catccggact gtactgggtc tctctggtta 5760 
taactaggga acccactgct taagcctcaa 5820 
tgtgcccgtc tgttgtgtga ctctggtaac 5880 
tggaaaatct ctagcagggc ccgtttaaac 5940 
ttgccagcca tctgttgttt gcccctcccc 6000 
tcccactgtc ctttcctaat aaaatgagga 6060 
ttctattctg gggggtgggg tggggcagga 6120 
caggcatgct ggggatgcgg tgggctctat 6180 
ctctaggggg tatccccacg cgccctgtag 624 0 
tacgcgcagc gtgaccgcta cacttgccag 6300 
cccttccttt ctcgccacgt tcgccggctt 6360 
tttagggttc cgatttagtg ctttacggca 6420 
tggttcacgt agtgggccat cgccctgata 6480 
cacgttcttt aatagtggac tcttgttcca 6540 
ctattctttt gatttataag ggattttgcc 6600 
gatttaacaa aaatttaacg cgaattaatt 6660 
aagtccccag gctccccagc aggcagaagt 6720 
accaggtgtg gaaagtcccc aggctcccca 6780 
aattagtcag caaccatagt cccgccccta 6840 
agttccgccc attctccgcc ccatggctga 6900 
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ctaatttttt ttatttatgc agaggccgag 
tagtgaggag gcttttttgg aggcctaggc 
tccattttcg gatctgatca gcacgtgttg 
tagtataata cgacaaggtg aggaactaaa 
tgctcaccgc gcgcgacgtc gccggagcgg 
cccgggactt cgtggaggac gacttcgccg 
tcagcgcggt ccaggaccag gtggtgccgg 
gcctggacga gctgtacgcc gagtggtcgg 
ccgggccggc catgaccgag atcggcgagc 
acccggccgg caactgcgtg cacttcgtgg 
atttcgattc caccgccgcc ttctatgaaa 
ccggctggat gatcctccag cgcggggatc 
tgtttattgc agcttataat ggttacaaat 
aagcattttt ttcactgcat tctagttgtg 
atgtctgtat accgtcgacc tctagctaga 
ctgtgtgaaa ttgttatccg ctcacaattc 
gtaaagcctg gggtgcctaa tgagtgagct 
ccgctttcca gtcgggaaac ctgtcgtgcc 
ggagaggcgg tttgcgtatt gggcgctctt 
cggtcgttcg gctgcggcga gcggtatcag 
cagaatcagg ggataacgca ggaaagaaca 
accgtaaaaa ggccgcgttg ctggcgtttt 
acaaaaatcg acgctcaagt cagaggtggc 
cgtttccccc tggaagctcc ctcgtgcgct 
acctgtccgc ctttctccct tcgggaagcg 
atctcagttc ggtgtaggtc gttcgctcca 
agcccgaccg ctgcgcctta tccggtaact 
acttatcgcc actggcagca gccactggta 
gtgctacaga gttcttgaag tggtggccta 
gtatctgcgc tctgctgaag ccagttacct 
gcaaacaaac caccgctggt agcggtggtt 
gaaaaaaagg atctcaagaa gatcctttga 
acgaaaactc acgttaaggg attttggtca 
tccttttaaa ttaaaaatga agttttaaat 
ctgacagtta ccaatgctta atcagtgagg 
catccatagt tgcctgactc cccgtcgtgt 
ctggccccag tgctgcaatg ataccgcgag 
caataaacca gccagccgga agggccgagc 
ccatccagtc tattaattgt tgccgggaag 
tgcgcaacgt tgttgccatt gctacaggca 
cttcattcag ctccggttcc caacgatcaa 
aaaaagcggt tagctccttc ggtcctccga 
tatcactcat ggttatggca gcactgcata 
gcttttctgt gactggtgag tactcaacca 
cgagttgctc ttgcccggcg tcaatacggg 
aagtgctcat cattggaaaa cgttcttcgg 
tgagatccag ttcgatgtaa cccactcgtg 
tcaccagcgt ttctgggtga gcaaaaacag 
gggcgacacg gaaatgttga atactcatac 
atcagggtta ttgtctcatg agcggataca 
taggggttcc gcgcacattt ccccgaaaag 
<210> 3 
<211> 233 
<212> DNA 

<213> Homo sapiens 



gccgcctctg cctctgagct attccagaag 6960 
ttttgcaaaa agctcccggg agcttgtata 7020 
acaattaatc atcggcatag tatatcggca 7080 
ccatggccaa gttgaccagt gccgttccgg 714 0 
tcgagttctg gaccgaccgg ctcgggttct 7200 
gtgtggtccg ggacgacgtg accctgttca 7260 
acaacaccct ggcctgggtg tgggtgcgcg 7320 
aggtcgtgtc cacgaacttc cgggacgcct 7380 
agccgtgggg gcgggagttc gccctgcgcg 7440 
ccgaggagca ggactgacac gtgctacgag 7500 
ggttgggctt cggaatcgtt ttccgggacg 7560 
tcatgctgga gttcttcgcc caccccaact 7620 
aaagcaatag catcacaaat ttcacaaata 7680 
gtttgtccaa actcatcaat gtatcttatc 7740 
gcttggcgta atcatggtca tagctgtttc 7800 
cacacaacat acgagccgga agcataaagt 7860 
aactcacatt aattgcgttg cgctcactgc 7920 
agctgcatta atgaatcggc caacgcgcgg 7980 
ccgcttcctc gctcactgac tcgctgcgct 8040 
ctcactcaaa ggcggtaata cggttatcca 8100 
tgtgagcaaa aggccagcaa aaggccagga 8160 
tccataggct ccgcccccct gacgagcatc 8220 
gaaacccgac aggactataa agataccagg 8280 
ctcctgttcc gaccctgccg cttaccggat 8340 
tggcgctttc tcatagctca cgctgtaggt 8400 
agctgggctg tgtgcacgaa ccccccgttc 8460 
atcgtcttga gtccaacccg gtaagacacg 8520 
acaggattag cagagcgagg tatgtaggcg 8580 
actacggcta cactagaaga acagtatttg 8640 
tcggaaaaag agttggtagc tcttgatccg 8700 
tttttgtttg caagcagcag attacgcgca 8760 
tcttttctac ggggtctgac gctcagtgga 8820 
tgagattatc aaaaaggatc ttcacctaga 8880 
caatctaaag tatatatgag taaacttggt 8940 
cacctatctc agcgatctgt ctatttcgtt 9000 
agataactac gatacgggag ggcttaccat 9060 
acccacgctc accggctcca gatttatcag 9120 
gcagaagtgg tcctgcaact ttatccgcct 9180 
ctagagtaag tagttcgcca gttaatagtt 9240 
tcgtggtgtc acgctcgtcg tttggtatgg 93 00 
ggcgagttac atgatccccc atgttgtgca 9360 
tcgttgtcag aagtaagttg gccgcagtgt 9420 
attctcttac tgtcatgcca tccgtaagat 9480 
agtcattctg agaatagtgt atgcggcgac 9540 
ataataccgc gccacatagc agaactttaa 9600 
ggcgaaaact ctcaaggatc ttaccgctgt 9660 
cacccaactg atcttcagca tcttttactt 9720 
gaaggcaaaa tgccgcaaaa aagggaataa 9780 
tcttcctttt tcaatattat tgaagcattt 9840 
tatttgaatg tatttagaaa aataaacaaa 9900 
tgccacctga c 9941 
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gaattcgaac gctgacgtca tcaacccgct ccaaggaatc gcgggcccag tgtcactagg 60 
cgggaacacc cagcgcgcgt gcgccctggc aggaagatgg ctgtgaggga caggggagtg 120 
gcgccctgca atatttgcat gtcgctatgt gttctgggaa atcaccataa acgtgaaatg 180 
tctttggatt tgggaatctt ataagttctg tatgagacca cagatctaag ctt 233 

<210> 4 
<211> 355 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This represents a mutant human sequence having an 
introduced bacterial tetOl binding, site . 

<400> 4 

gggaattccc ccagtggaaa gacgcgcagg caaaacgcac cacgtgacgg agcgtgaccg 60 
cgcgccgagc ccaaggtcgg gcaggaagag ggcctatttc ccatgattcc ttcatatttg 120 
catatacgat acaaggctgt tagagagata attagaatta atttgactgt aaacacaaag 180 
atattagtac aaaatacgtg acgtagaaag taataatttc ttgggtagtt tgcagtttta 240 
aaattatgtt ttaaaatgga ctatcatatg cttaccgtaa cttgaaagta ctctatcatt 300 
gatagagtta tatatcttgt ggaaaggacg aaacaccgtg gtcttcaagc ttccg 355 

<210> 5 
<211> 634 
<212> DNA 
<213> E. coli 



<400> 5 

gctagccacc atgtccagat 
taatgaggtc ggaatcgaag 
gcagcctaca ttgtattggc 
tgagatgtta gataggcacc 
ttttttacgt aataacgcta 
aaaagtacat ttaggtacac 
agccttttta tgccaacaag 
ggggcatttt actttaggtt 
aagggaaaca cctactactg 
tgatcaccaa ggtgcagagc 
agaaaaacaa cttaaatgtg 



tagataaaag taaagtgatt 
gtttaacaac ccgtaaactc 
atgtaaaaaa taagcgggct 
atactcactt ttgcccttta 
aaagttttag atgtgcttta 
ggcctacaga aaaacagtat 
gtttttcact agagaatgca 
gcgtattgga agatcaagag 
atagtatgcc gccattatta 
cagccttctt attcggcctt 
aaagtgggtc ttaa 



aacagcgcat tagagctgct 60 
gcccagaagc taggtgtaga 12 0 
ttgctcgacg ccttagccat 180 
gaaggggaaa gctggcaaga 240 
ctaagtcatc gcgatggagc 30 0 
gaaactctcg aaaatcaatt 360 
ttatatgcac tcagcgctgt 420 
catcaagtcg ctaaagaaga 480 
cgacaagcta tcgaattatt 540 
gaattgatca tatgcggatt 600 

634 
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